A FINITENESS PROPERTY OF TORSION POINTS 



MATTHEW BAKER, SU-ION IH, AND ROBERT RUMELY 

Abstract. Let k be a number field, let E/k be an elliptic curve, and let S be a finite set of 
places of k containing the archimedean places. We prove that if a G E(k) is nontorsion, then 
there are only finitely many torsion points £ G E(k)t OIS which are S'-integral with respect to 
a. We also prove an analogue of this for the multiplicative group, and formulate conjectural 
generalizations for abelian varieties and dynamical systems. 



1. Introduction 

Let k be a number field, with ring of integers Ok and algebraic closure k, and let E/k be 
an elliptic curve. Let £/Spec(0fc) be a model of E, and let S be a finite set of places of k 
containing the archimedean places. In this paper we will prove: 

Theorem 1.1. If a £ E(k) is nontorsion [that is, has canonical height h(a) > 0), then 
there are only finitely many torsion points £ G E(k) t0 rs which are S-integral with respect to 
a. 

By S'-integrality we mean that the Zariski closures of £ and a in the model £ /Spec(Ofc) do 
not meet outside fibres above S. Since any two models are isomorphic outside a finite set of 
places, the finiteness property is independent of the set S and the model S. 

We will also prove an analogue of Theorem II. II for the multiplicative group ( Theorem 12.11 
below). Theorems 11.11 and 12.11 are analogues for non-compact varieties, where it is most 
natural to look at integral points, of the Manin-Mumford conjecture (first proved by Raynaud 
|Ra83]) . 

The ingredients of the proof of Theorem 11.11 are a strong form of equidistribution for 
torsion points at all places v, properties of local height functions, and David/Hirata-Kohno's 
theorem on linear forms in elliptic logarithms. In outline, the proof is as follows. By base 
change, one reduces to the case where a G E(k). Given a place v of k, let k v be the algebraic 
closure of the completion k v , and let : E(k v ) — > K be an appropriately normalized Neron- 
Tate canonical local height. On the one hand, elementary properties of heights show that 
for any torsion point £ n , one has 

(1) h(a) = E _ • 
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On the other hand, if {£ n } is a sequence of distinct torsion points which are S-integral with 
respect to a, then for each v 



By the integrality hypothesis, the outer sum in (JTJ can be restricted to v G S, allowing the 
limit and the sum to be interchanged. This gives h(a) = 0, contradicting the assumption 
that a is nontorsion. 

Examples show that the conclusion of Theorem II .11 is false if a is a torsion point, and that 
it can fail if is merely a sequence of small points (that is, a sequence of points with 
h(£ n ) — ► 0). In particular, Theorem 11.11 cannot be strengthened to a theorem of Bogomolov 
type. 

Theorem II .11 is the first known case of general conjectures by the second author (as refined 
by J. Silverman and S. Zhang) concerning abelian varieties and dynamical systems. Assume 
as before that k is a number field, and let S be a finite set of places of k containing the 
archimedean places. Let Ok,s be the ring of S'-integers of k. 

Conjecture 1.2. (Ih) 

Let A/k be an abelian variety, and let As/Spec(Ok,s) be a model of A. Let D be an 
effective divisor on A, defined over k, at least one of whose irreducible components is not 
the translate of an abelian subvariety by a torsion point, and let D be its Zariski closure in 
As- Then the set A DjS (Z) tors , consisting of all torsion points of A{k) whose closure in As 
is disjoint from D, is not Zariski dense in A. 

Conjecture 1.3. (Ih) 

Let R(x) G k(x) be a rational function of degree at least 2, and consider the dynamical 
system associated to the rational map -R* : P 1 — > P 1 . Let a G P 1 (/c) be non-preperiodic for 
R*. Then there are only finitely many pre-periodic points £ G P X (A;) which are S -integral with 
respect to a, i.e. whose Zariski closures in F 1 /Spec(Ok,s) do not meet the Zariski closure of 
a. 

Theorem ll.il in addition to being the one-dimensional case of Conjecture II .21 is equivalent 
to Conjecture 11.31 for Lattes maps. That is, if E/k is an elliptic curve, let R G k{x) be the 
degree 4 map on the ^-coordinate corresponding to the doubling map on E, so that the 
following diagram commutes: 



Then (3 G E(k) is a torsion point if and only x{(3) is preperiodic for R*. 

We will also prove Conjecture 11.31 for the map R(x) = x 2 , where the preperiodic points 
are 0, oo, and the roots of unity. In that case, the assertion is that there are only finitely 
many roots of unity which are S"-integral with respect to a given non-root of unity a G k 
(see Theorem 12. lj) . Conjecture II. HI for Chebyshev maps can be deduced by similar methods, 
though we do not do so here. 
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The motivation for Conjecture 11.21 is the following analogy between diophantine theorems 
over k and k, and over Ok and Z (the ring of all algebraic integers). Let A/k be an abelian 
variety, and let X be a non-torsion subvariety of A (that is, X is not the translate of an 
abelian subvariety by a torsion point). Recall that the Mordell-Lang Conjecture (proved by 
Faltings) says that A{k)C\X is not Zariski dense in X; while the Manin-Mumford Conjecture 
(first proved by Raynaud) says that A(k) tOTS DX is not Zariski dense in X. Likewise, Lang's 
conjecture (also proved by Faltings) says that if D is an effective ample divisor on A, then 
the set AoiOk) of Ofc-integral points of A not meeting supp(D) is finite. Note that A is 
compact, whereas Ad = A\supp(-D) is noncompact. 



Type of variety; Type of rationality 


k 


k 


Compact; k, /c-rationality 


Mordell-Lang 
Conjecture 


Manin-Mumford 
Conjecture 


Noncompact; Ok, Z-rationality 


Lang's 
Conjecture 


Ih's 

Conjecture 11.21 



Conjecture 11.31 is motivated by Conjecture 11.21 and the familiar analogy between torsion 
points of abelian varieties and preperiodic points of rational maps. 

The paper is divided into two sections. In the first, we prove Conjecture 11.31 for the 
dynamical system R(x) = x 2 . In the second, we prove Conjecture for elliptic curves. 

Throughout the paper, we will use the following notation. For each place v of k, let k v 
be the completion of k at v and let \x\ v be the normalized absolute value which coincides 
with the modulus of additive Haar measure on k v . If v is archimedean and k v = R, then 
\x\ v = \x\, while if k v = C then \x\ v = \x\ 2 . If v is nonarchimedean and lies over the rational 
prime p, then \p\ v = p~^ kv '-^. For ^ a G k, the product formula reads 

] J \at\v = 1 • 

V 

If k v is an algebraic closure of k v , there is a unique extension of \x\ v to k v , also denoted 
\x\ v . Given a finite extension L/k, for each place w of L we have the normalized absolute 
value \x\ w on L w . If we embed L w in k v , then | for each x e L w . Write 

log(x) for the natural logarithm of x. Given (3 G L and a place v of k, as a ranges over all 
embeddings of L into k v fixing k we have 

(3) £ log(kC9)|„) = E lo s(l/5U)- 

The absolute Weil height of a G k (also called the naive height) is defined to be 

M«) = [ fe 1 q] £max(0,log(|a|„)) . 

It is well known that for a G Q, h(a) is independent of the field k containing Q(a) used to 
compute it, so h extends to a function on Q. Furthermore h(a) > 0, with h(a) = if and 
only if a = or a is a root of unity. 
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2. Ih's conjecture for the dynamical system R(x) = x 2 . 



2.1. The finiteness theorem. Let S be a finite set of places of k containing the archimedean 
places. Given a, (3 £ k, view them as points in P 1 (fc) and let cl(a),cl(/3) be their Zariski 
closures in P 1 /Spec((9fc)- By definition, (3 is S-integral relative to a if cl(/3) does not meet 
cl(a) outside S. Thus, (3 is ^-integral relative to a if and only if for each place v of k not in 
S, and each pair of embeddings a : fe(/3) ^ k v , t : k(a) we have ||o"(/3), r(a)||„ = 1 

under the spherical metric on P 1 (/c ?; ). Equivalently, for all a, r 

f \a(/3)-r(a)\ v >l if |r(a)|„ < 1 , 
1 W)|«<1 if |r(a)|„>l . 

Theorem 2.1. Ze£ k be a number field, and let S be a finite set of places of k containing 
all the archimedean places. Fix a £ k with h(a) > 0; that is, a is not or a root of unity. 
Then there are only finitely many roots of unity in k which are S-integral with respect to a. 

Before giving the proof, we note some examples which limit possible strengthenings of the 
theorem. 

A) The hypothesis h(a) > is necessary: 

If a = 0, take k = Q. Then each root of unity ( n is integral with respect to a at all finite 
places. If a = 1, then each root of unity of composite order is integral with respect to a at 
all finite places. 

If a — Cn is an N th root of unity with N > 1, take k = Q(Cat)- If Cm is a primitive m th 
root of unity with (m, N) = 1 and m > 1, then Ca^C™ is a primitive mN th root of unity 
whose order divisible by at least two primes. This means 1 — Ca^C?™ is a unit, so Cn — Cm is 
also a unit. Hence ( m is integral with respect to a at all finite places. 

B) When h(a) > 0, one can ask if the theorem could be strengthened to a result of 
Bogomolov type: is there a number B = B(a) > such that there are only finitely many 
points (3 £ k with h(f3) < B which are S-integral with respect to a? That is, could finiteness 
for roots of unity be strengthened to finiteness for small points? 

The following example shows this is not possible. Take k = Q, a = 2, and S = {oo}. For 
each n, let (3 n be a root of the polynomial 

f n (x) = x 2 \x-2)-l. 

Here f n (x + 1) is Eisenstein with respect to the prime p = 2, so f n (x) is irreducible over 
Q. Note that each (3 n is a unit. By Rouche's theorem, (3 n has one conjugate very near 2 
and the rest of its conjugates very close to the unit circle; this can be used to show that 
lim^oo h((3 n ) = 0. Finally, (3 n — 2 is also a unit, so (3 n is integral with respect to 2 at all 
finite places. 

Proof, of Theorem 12.11 

By replacing k with k(a), and S with the set of places Sk( a ) lying over S, we are reduced 
to proving the theorem when a £ A;. Indeed, if ( is a root of unity which is ^-integral with 
respect to a over k, then each /c-conjugate of ( is S^^-integral with respect to a over k(a). 
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Suppose a G k, and that there are infinitely many distinct roots of unity {£„} which are 
S- integral with respect to a. For each n, we will evaluate the sum 

(4) ^=[^Tq]E E 

in two different ways. On the one hand, an application of the product formula will show that 
each A n = 0. On the other hand, by applying the integrality hypothesis, Baker's theorem 
on linear forms in logarithms, and a strong form of equidistribution for roots of unity, we 
will show that lim^oo A n = h(a) > 0. This contradiction will give the desired result. 

The details are as follows. First, using (J3J), formula (jlj) can be rewritten as 

A " = wcy^i £ log(lc "" aU) ' 

W Of fc(fn) 

Since a is not a root of unity, the product formula gives A n = 0. 

Next, take v ^ S. If \a\ v > 1 then by the ultrametric inequality, for each a : k(( n )/k "—>■ k v 
we have \&(( n ) — a\ v = \ce\ v . On the other hand, if \a\ v < 1, the integrality hypothesis gives 
\a(( n ) — oz\ v = 1. It follows that for each v ^ S 

(5) [fc(Cn ) : Q] E _ log(k(Cn) - «W = ^max(0,log(|a| w )), 
so that 

(6) A. = E^^] £ 'og(k(C„)- a |„) 



^^^max(0,log(|a|„)) . 



/>• : 'i. I 1 

Now let n — > oo in (jSJ. Since S 1 is finite, we can interchange the limit and the sum over 
v G S, obtaining 

= E(i^ WTTQ] E Mk(Cn) - «w) 

(7) + i^^j^max^^ogda^) . 

We will now show that for each v G 5, 

1 ]_ 

( 8 ) lim r; 77 \ Hi E lo S(k(Cn) - = r/ 01 max(0, log(|a|„) . 

Inserting this in (JJJ) gives h(a) = 0, a contradiction. 

For each nonarchimedean v G S, (jHJ) is trivial if |a|„ > 1 or < 1. In the first case 
\ a {Cn) — oi\ v = \oc\ v for all n and all a, and in the second case |c(Cn) ~~ a \v — 1 for all n and 
all a. Hence we can assume that laL = 1. 
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Lemma 2.2. Let v be nonarchimedean, and suppose \a\ v = 1. Then 

(A) There is a bound M(a) > such that \( — a\ v > M(a) for all roots of unity ( G k v . 

(B) For each < r < 1, there are only finitely many roots of unity ( G k v with \( — a\ v < r. 

Proof. Since a is not a root of unity, (A) follows immediately from (B). For (B), note that 
if ( and (' are roots of unity with \( — a\ v < r and \(' — a\ v < r, then \( — ( f \ v < r and so 
(" = C _1 C is a ro °t °f unity with |1 — < r. There are only finitely many such Indeed, 
if p is the rational prime under v, the only roots of unity £ G k v with |1 — £\ v < 1 are ones with 
order p n for some n. If £ is a primitive p n -th. root of unity, then |1 — £|„ = p-l k -» : Qp]/p n (p- 1 ) 
sol>r>|l — £\ v for only finitely many n. □ 



Assuming v is nonarchimedean and = 1, let M(a) be as in the Lemma. Fix < r < 1, 
and let N(r) be the number of roots of unity in k v with \( — a\ v < r. For each ( n and each 
cr : k(( n )/k — > k v , we have |cr(£ n ) — a\ v < 1, so 



a l^wura E lo 8(k(c» 



n— *oo 
1 



HQ : 



a\ 

cr:k(Cn)/k<-*k v 



> lim 1 ^ (([ fc (Cn) : fe] - N{r)) ■ log(r) + iV(r) • log(M(a))) 



log(r) 



[A;: 

Since r < 1 is arbitrary, the limit in (jHJ) is 0, verifying (jHJ) in this case. 

Now suppose v is archimedean. To simplify notation, view fcasa subfield of C and identify 
k v with C. (Thus, the way k is embedded depends on the choice of v). 
By Jensen's formula (see |Co86j . p. 280) applied to f(z) = z — a, 

(9) -L riog(\e w -a\)dd = max(0,log(|a|)) . 

^ Jo 

Here \x\ can be replaced by \x\ v . 

The Gal(A;/A;)-conjugates of roots of unity equidistribute in the unit circle. We will give 
a direct proof of this below, but we note that it also follows from generalizations of Bilu's 
theorem, for example the equidistribution theorem for polynomial dynamical systems given 
in Baker-Hsia ( BHpp|). The Baker-Hsia theorem implies that if fi n is the discrete measure 



where 5p(x) is the Dirac measure with mass 1 at P, then the [i n converge weakly to the 
Haar measure [i = (l/2n)d8 on the unit circle. 

If \a\ v > 1 or \a\ v < 1 then log(|z — a\ v ) is continuous on the unit circle. In these cases, 
(jHJ) follows from (jHJ) and weak convergence. If \a\ v = 1 then log(|z — a\ v ) is not continuous 
on \z\ = 1 and weak convergence is not enough to give J< z < =1 log(|z — a|) d/i n (z) — ► 0. There 
could be a problem if some conjugate were extremely close to a, or if many conjugates 
clustered near a. 
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The first problem is solved by A. Baker's theorem on lower bounds for linear forms in 
logarithms (see Baker |Ba75j . Theorem 3.1, p. 22). We are assuming that \oc\ v = 1, and a 
is not a root of unity. Fix a branch of log with log(z) = log(|^|) + id, —ti < 9 < tt, and 
write log (a) = i9q. For another branch, log(l) = 27ci. The following is a special case of 
Baker's theorem. (In his statement of the theorem, Baker uses an exponential height having 
bounded ratio with H(f3) = e h{f3) .) 

Proposition 2.3. (A. Baker) There is a constant C = C(a) > such that for each (3 = 
a/N e Q 

| i0o-/3 -2m\ > e - c - max(1 ' h(/3)) , 
where h(/3) = log(max(|a|, \N\)) is the absolute height of (3. 



The second problem is settled by a strong form of equidistribution for roots of unity, proved 
in £ 12.21 below. It says that for any < 7 < 1, the conjugates of the ( n are asymptotically 
equidistributed in arcs of length [k(( n ) : A;]" 7 . Note that weak convergence is equivalent to 
equidistribution in arcs of fixed length. 

Proposition 2.4. (Strong Equidistribution) Let k C C be a number field. Then the 
Gal(k/ k) -conjugates of the roots of unity in k (viewed as embedded in C) are strongly equidis- 
tributed in the unit circle, in the following sense. 

Given an arc I in the unit circle, write //(/) = j- length (/) for its normalized Haar mea- 
sure. If £ G k is a root of unity, put 

N(CI) = #MC) e I : a E Gal(fc/fc)} . 

Fix < 7 < 1. Then for all roots of unity ( and all I , 

(10) y^l^r = + O,([k(0 : kD . 

We remark that a strong equidistribution theorem for points of small height with respect 
to an arbitrary dynamical systems on P 1 has recently been proved by C. Favre and J. Rivera- 
Letelier ( |FRLpp| , Theoreme 6). 

Assuming Proposition 12.41 we will now complete the proof of Theorem 12.11 by showing 
that JSJ) holds for archimedean v when \a\ v = 1. 

Let fi = (l/2n)d8 be the normalized Haar measure on the unit circle, and for each n, put 

L KSUJ 1 a:fc(Cn)/fc^C 

Then \i n is supported on the unit circle and the \x n converge weakly to //. We must show 
that 

/ \og{\z-a\)dii n {z) = Vlog(|a(Cn) - a\) -> 0. 

J\z\=i FlUJ • K \ a 

The idea is to break the sum into three parts: the terms nearest a, which can be treated 
by Baker's theorem; the other terms in a small neighborhood of a, which can be dealt with 
by strong equidistribution; and the rest, which can be handled by weak convergence. 
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Fix < e < 1. We will show that for all sufficiently large n, 
(11) I / log{\z-a\)dv n {z)\ < 6e . 

J\z\=l 

Note that J Q £ \og(t/e) dt = —e. For the remainder of the proof, we restrict to \z\ = 1; write 
a = e te ° where — 7r < 6$ < it, and write z = e ld where 6q — tt < 9 < 6q + tt. Define 

larg Q£ (z) = min(0, log(|6> - B Q \/e)) . 

Then there is a continuous function g a)£ (z) on \z\ = 1 for which log(|z — a\) = larg Q£ (;z) + 
g at£ (z). Recalling that JLi =1 log(|z — a\) dfi(z) = 0, we have 



f f e d6 £ 

z\=l J\z\=l JO 

By weak convergence, it follows that for all sufficiently large n, 



2-7T 7T 



(12) | / g a , £ (z) dn n {z)\ < e . 

J\z\=l 

To obtain (fTTj) . it will suffice to show that for all sufficiently large n, 

larg (z) dfM n (z)\ < 5e . 



\z\=l 

For each interval [c,d] let I a ([c,d}) be the arc {ae 2mt : t 6 [c, d]}. Noting that larg Qe (z) is 
supported on I a ([—e,e]), put D = D n = \[k(( n ) : k} 1 ^ 2 ] and divide I a ([—e,e]) into 2D equal 
subarcs. Taking 7 = 2/3 in Proposition 12 .41 it follows that if n is sufficiently large, each such 
subarc contains at most 2e[k(( n ) : k] 1 / 2 conjugates of ( n . 

First consider the union of the two central subarcs, I a ([—e/D,e/D}). Let N be the order 
of Cn- Let cxo(Cn) = e 2ma ^ N be the conjugate of ( n closest to a = e l9 °. We can assume that 
\ a /N\ < 1, which implies that that h(a/N) = max(log(|a|), log(iV)) = log(iV). By Baker's 
theorem, 

\2n(a/N) -9 \ > e -ew(i,]og(JV)) _ 
Hence if n is sufficiently large, 

larg Q > (Cn)) > -Clog(JV)-log(e) > -Clog(N) . 
Since there are at most ie[k(C n ) '■ k] 1 ^ 2 conjugates of ( n in I a ([—e/D,e/D)), 

> / larg (\z - a\) dfin(z) > -4 S,} ^},^ ■ 

Jl a ([-e/D,e/D]) HCn) ■ k} 1 / 2 

Note that [k(( n ) : k] > [Q(( n ) : Q]/[k : Q] = (p(N)/[k : Q]. For all large N, <p(N) > N 1 ^ 2 , so 
there is a constant B such that [k(Cn) '■ k} 1 / 2 > BN l l A . Thus for all sufficiently large n, 



(13) I / \og(\z - a\) dn n (z)\ < e. 

Jl a ([-8/D,8/D]) 

Finally, consider the remaining subarcs. For £ — 1, . . . , D — 1, if 

z G I a ([£e/D, (£ + l)e/D\) or z e I a ([-£e/D, -(£ + l)e/D\) 
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then > larg (z) > \og(£/D). As before, by Proposition 12.41 for sufficiently large n, each 
subarc contains at most 2[k(( n ) : k](e/D) conjugates of ( n . It follows that 

> / larg a e (» d/j, n (z) 

Jl a ([-S,S])\I a ([-S/D,S/D}) 

log(( D )/£) ' D 

(14) > 4 f \og(t/e) dt = -4e . 

Jo 

Combining ()12)) , (jlHj) , and ()14)) gives (jllj) , which completes the proof of Theorem 12.11 □ 

In the course of writing this paper, the authors learned of several results related to Theorem 
I2.1[ some of which imply it in special cases. 

A. Bang's theorem jK1886j (1886) says that if a ^ ±1 is a nonzero rational number, then 
for all sufficiently large integers n there is a prime p such that the order of a modulo p 
is exactly n. This can be rephrased as saying that for all sufficiently large n, there exists 
a primitive n-th root of unity ( n and a nonzero prime ideal p of Z[£ n ] such that a = C, n 
(mod p). Since all primitive n-th roots are conjugate over Q, this implies Theorem 12.11 in 
the case a G Q. A. Schinzel [Sc74| gave an effective generalization of Bang's theorem to 
arbitrary number fields; Schinzel's theorem implies Theorem 12. II for number fields k which 
are linearly disjoint from the maximal cyclotomic field Q ab , and a G k. 

J. Silverman |Si95j has shown that if a G Q is an algebraic unit which is not a root of 
unity, there are only finitely many m for which $ m (a;) is a unit, where $ m (x) is the m-th 
cyclotomic polynomial. In fact, if d = [Q(ct) : Q] he shows there is an absolute, effectively 
computable constant C such that the number of such m's is at most 

q . ^1+0.7/ log(log(d)) _ 

In the case when a is a unit, this yields Theorem 12.11 in the same situations as Schinzel's 
theorem. 

G. Everest and T. Ward ( EW99], Lemma 1.10) show that if F(x) G Z[ x] is monic and 
irreducible, with roots a±, . . . , ad, and if F(x) is not a constant multiple of x or a cyclotomic 
polynomial <3> m (x), then the quantity A n (F) = Of=i C^? — 1) satisfies 

(15) lim -log(A n (F)) = m(F) > 0, 

where m(F) = deg(F) • h(cti) is the logarithm of the Mahler measure of F(x). When k = Q, 
and a = a\ is an algebraic integer, the product formula tells us that n^ofQ \A n (F))\ v = 1, 
so for all large n there must be some nonarchimedean v and some such that |a" — l\ v — 1, 
and this in turn means there is some n-th root of unity ( with \oii — C|« < 1- However, this 
is not strong enough to give Theorem 12. II because (a) ( might not be primitive, and (b) the 
primitive n-th roots of unity might not all be conjugate to one another over Q(a). 

2.2. Strong equidistribution for roots of unity. We will now prove Proposition I2.4| the 
strong equidistribution theorem for roots of unity. At least when k — Q, the result is well 
known to analytic number theorists, but we do not know a reference in the literature. 
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The proof rests on the following lemma, for which we thank Carl Pomerance. Let <p(N) 
denote Euler's function and let d(N) = J2 m \N 1 ^ e ^ ne divisor function. We write A(m) for 
the number of primes dividing m, and use 9{x) to denote a quantity satisfying — x < 6{x) < x. 

Lemma 2.5. (Pomerance) Fix an integer Q > 1 and an integer b coprime to Q. Then for 
each integer N > 1 divisible by Q and each interval (c, d] C M, 

#{a G (c, d] n Z : (a, JV) = 1, a = 6 (mod Q)} = ffi^ (d - c) + 0(d(iV)) . 

Remark 16. The main content of the lemma is that the error depends only on AT, and not 
on Q or (c, d] . 

Proof. Let pi, . . . ,p r be the primes dividing iV but not Q. (If there are no such primes, 
take p\---p r = 1 in the argument below). Take bo G Z with bo = b (mod Q), &o 
I ) (mod p\ . . .p r ). Then 

{a G (c, d] n Z : a = 6 (mod Q), (a, AT) = 1} 

= {aG(c,d]nZ: Q\a - b , pt, . . . ,p r J(a - b } 

For each positive integer m dividing p 1 ■ ■ -p r put r mjbj g(c, d) = #{a G (c, d] HZ : Qm\a — b }. 
Then 

r ro ,6,Q(c,cZ) = [(d-bo)/Qm\-[(c-b )/Qm\ = _L(d- c ) +0(1) . 

Carrying out inclusion/exclusion relative to the primes pi, . . . ,p r we have 
#{a G (c, d] n Z : a = b (mod Q), (a, iV) = 1} 

= Yl (-±) Xim) r m , b , Q (c,d) 

m\p l ---p r 



Lf[(i-i).(d-c) + e(d( Pl --- Pr )) 

v . =1 Pi 

(d - c) + 0(d(AO) • 



□ 



Ncp(Q) 

Proof, of Proposition 12.41 

Let denote a primitive N th root of unity. There are only finitely many subfields of k, 
so there are only finitely subfields of the form k^ = fcnQ(Cjv) for some N. For each N there 
is a minimal Q for which k^ = kQ, and then Q(Cq) C Q(Ctv) so Q| A/". We will call Q = Qn 
the cyclotomic conductor of (n relative to k, and write Tjy = [Q(Cqjv) : ^v]- 

As Q(Cat) is Galois over Q, it is linearly disjoint from k over k N , and Gal(/c(£jv)/&) = 
Gal(Q(( N )/k N ). Since fcjv C Q(Cqjv) C Q((n), the conjugates of (jv over /c are a union of 
Tjy sets of the form 

{e 2ma/N . Q _ ( a> jy) = 1} , 



for certain numbers foj coprime to Q 
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Let I be an arc of the unit circle corresponding to an angular interval (#1, 9q\. Put (c, d] = 
6 2 ]. Then e 2nia/N G I if and only if a G (c, d). By Lemma 1231 

ip(N) N 

(17) n Kn ,i) = r^.^L.-^-ft) + epww). 

Recall that for any 5 > 0, if N is sufficiently large then d{N) < N 5 and <p(iV) > N l ~ s 
(see Hardy and Wright FfW71j, Theorem 315, p. 260, and Theorem 327, p. 267). Take 6 such 
that < 25 < 1 — 7. Noting that [fc(Civ) : — T N <p(N)/{p(Q N ), and that ^p{Qn) is bounded 
independent of iV, (fTTj) gives 

(18) ^4 = M^) + 7 (iV^). 

Since [&(Civ) : k] < N, the error bound in (fTHj) holds with N replaced by [fc(Civ) : k]. Since 
[A;(Cjv) : /c]/iV 7 — > 00 as iV — > 00, adjoining or removing endpoints of / will not affect the 
form of the estimate, so (jlOJ) applies to all intervals. □ 

3. Ih's conjecture for elliptic curves. 

3.1. The finiteness theorem. Let k be a number field, and let E/k be an elliptic curve. 
We can assume E is defined by a Weierstrass equation 

(19) y 2 + a\xy + a 3 y = x 3 + a 2 x 2 + a 4 x + a 6 

with coefficients in Ok- More precisely, E is the hypersurface in P 2 /Spec(fc) defined by the 
homogenization of (|T9"|) . Let A be its discriminant. 

Given a nonarchimedean place v of k and points a, (3 G E(k), we will say that f3 is 
integral with respect to a at v if the Zariski closures cl(/3) and cl(a) do not meet in the 
model £„/ Spec ((!?<,) defined by the homogenization of (fT9"j) . Equivalently, if \\z, w\\ v is the 
restriction of the spherical metric on F 2 (k v ) to E(k v ) (see |Ru89| . §1.1), then for each pair 
of embeddings a, r : k/k ! — > k v , 

\\a(f3),r(a)\\ v = 1 . 

If S is a set of places of k containing all the archimedean places, we say f3 is S"-integral with 
respect to a if (3 is integral with respect to a at each v S. 
Write h(a) for the canonical height on E(k), defined by 

h(a) = \ lim 1m*([2») = \ lim ^^([2 n ]«) , 

Z n— >oo 4 O n— »oo 4 

where /ipi (resp. /ip2) is the naive height on ¥ 1 (k) (resp. P 2 (A;)), and [m] is multiplication by 
m on E(k). (For a discussion of /i(a) and its properties, see [Si86| . pp. 227-231 and 365-366; 
or |Si99j . §VI.) Recall that h(a) > 0, that h([m]a) = m 2 h(a) for all m, and that h(a) = if 
and only if a G E(k)t OTS - From these facts it follows (as is well known) that if £ G E(k) tOTS , 
then 

(20) h(a) = h{a-£) . 

There is also a decomposition of h(a) as a sum of local terms. For each place v of k, let 
X V (P) be the local Neron-Tate height function on E(k v ). For compatibility with our absolute 
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values we normalize X V (P) so that X V (P) = [k v : Q p ] ■ X V) sn(P), where X V) sn(P) is the local 
Neron-Tate height defined in Silverman ( |Si86| . p. 365). For each 0^«G E{k) 

(2D = I^Q] E A » 

1 ^ v oi k 

(see |Si86j . Theorem 18.2, p. 365). Note that only finitely many terms in the sum are nonzero. 

If L/k is a finite extension, for each place w of L there is a normalized local Neron-Tate 
height X W (P) on E{L W ). If we fix an isomorphism L w = k v , then for all P G E(k v ), 

(22) A W (P) = [L w : k v )X v (P) . 

It follows that if j3 G E(L), then for each place t> of fc, as cr runs over all embeddings of L 
into k v fixing k, 

(23) £ X v (a(j3)) = J2 X M ■ 

We will use the following explicit formulas. 

Proposition 3.1. Let k be a number field, and let E/k be an elliptic curve. Letv be a place 
ofk. 

A) If v is archimedean, fix an an isomorphism E(k v ) = C/A for an appropriate lattice 
A C C. Let a(z,A) be the Weierstrass a-function, let A(A) = # 2 (A) 3 - 27# 3 (A) 2 be the 
discriminant of A, and let 77 : C — > R be the M.-linearized period map associated to the 
Weierstrass (-function ((z,A). If P e E(k v ) corresponds to z G C, then 

X V (P) = -log(|A(A) 1 /i2 e -^/V(z,A)|J . 

Furthermore, if n v (z) is the additive Haar measure on E(k v ) which gives E(k v ) = C/A 
total mass 1, then 



X v (z) djjL v {z) = . 

'E(k v ) 

B) If v is nonarchimedean and E has split multiplicative reduction at v (so E is k v ~ 
isomorphic to a Tate curve), fix a Tate isomorphism E(k v ) = k* /q z where q G k v satisfies 
\q\ v = \l/j(E)\ v < 1. Let B 2 (x) = x 2 — x + | be the second Bernoulli polynomial, and put 

X v (x) = |-B 2 ( ord ^, • (— log(|g|„)). If P E E(k v ) corresponds to z G k* , with z chosen so 
that \q\ v < \z\ v < 1, then 

X V (P) = -log(|l-z|„) + A„(ord„(z)) . 

Furthermore, if fi v is the Haar measure dx/ord v (q) giving the loop K/(Z ■ ord v (q)) total 
mass 1, then 

rord„(<j) 

X v (x) dfi v (x) = . 



C) If v is nonarchimedean and E has good reduction at v, let w\\ v be the spherical 
metric on E(k v ) induced by a projective embedding E •—>■ P 2 corresponding to a minimal 
Weierstrass model for E at v. Then for each P G E v (k v ) 

x v (p) = -io g (||p,oy . 
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Proof. This is a summary of results in ( |Si99j . §VI); see in particular Theorem 1.1, p. 455; 
Theorem 3.2, p. 466; Theorem 3.3, p. 468; and Theorem 4.1, p. 470. □ 



We now come to Ih's conjecture for elliptic curves. The following is a restatement of 
Theorem II. II in the Introduction. 

Theorem 3.2. Let E/k be an elliptic curve, and let S be a finite set of places ofk, containing 
all the archimedean places. Let a G E{k) be a nontorsion point, i.e., a point with h(a) > 0. 
Then there are only finitely many £ G E(k) t0 rs which are S-integral with respect to a. 

Again there are limitations to possible strengthenings of the theorem: 

A) As noted by Silverman, it is necessary that a be nontorsion. If a = and S is the set of 
archimedean places, then by Cassels' generalization of the Lutz-Nagell theorem (Proposition 
13.51 below), each torsion point whose order is divisible by at least two primes is ^-integral 
with respect to a. 

Similarly, if a is a torsion point of order iV > 1, let S contain all places of bad reduction 
for E. Then for each q coprime to N, all g-torsion points are .S-integral with respect to a. 

B) When h(a) > 0, Zhang has pointed out that Theorem 13.21 cannot be strengthened 
to a result of Bogomolov type. A result of E. Ullmo ( |U95j . Theorem 2.4) shows that for 
each e > 0, there are infinitely many points (3 G E(k) with h(j3) < e which are integral with 
respect to a. 

Proof. The argument is similar to the proof of Theorem I2.1| but requires more machinery. 
We begin with some reductions. 

First, after replacing k by k(a), and S by the set Sk( a ) of places lying over S, we can 
assume that a G k. 

Second, after replacing k by a finite extension K/k, and replacing S with the set Sk of 
places of K lying above places in S, we can assume that E has semi-stable reduction. Thus 
we can assume without loss of generality that for nonarchimedean v, either E has good 
reduction, or E is fc„-isomorphic to a Tate curve. 

Third, after enlarging S if necessary, we can assume that S contains all v for which 
|A|„ 7^ 1. In particular, we can assume that the model of E defined by (fTU|) has good 
reduction for all v $L S. 

We claim that if £ n G E(k) tors is any torsion point, then 
(24) Ma) = M ^j: £ (60). 

To see this, let L be the Galois closure of k(£ n )/k. By (j^Uj) and (|2"T|). for each conjugate 

h(a) = h(a - <r(( n )) = 1 ^ X w (a - a(£ n )) . 

w of L 

Averaging over all embeddings o : L > k, fixing an embedding k ■=— > k v for each place v of 
K, using (J22j) . and noting that there are only finitely many nonzero terms in each sum, we 
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have 

1 1 

M«) = t^t E tttq] E a -(« - 

1 1 a:L/k<->k 1 w of L 

= r^QjE E prE^^i-^-o) 

« of k a-.L/k^kv w\v 

= tlthT, E 

« of fc cr:L/k^k v 

Since each conjugate cr(£ n ) occurs [L : fc(£ n )] times in the final inner sum, this is equivalent 
to (|2IJ). 

Suppose there were an infinite sequence of torsion points {£„} which were S'-integral with 
respect to a. 

If v S, our initial reductions assure that E has good reduction at v. By Proposition 
13. 1I C and the integrality hypothesis, X v (a — cr(£ n )) = for each n and a. It follows that 

(25) H (a) = j: M L 71 - y: *.(<.-*(«>. 

In the following two subsections, we will show that for each v G S, 

(26) Km * . Q] E A,(a-a(a)) = 0. 

\ a:k(i n )/k<-^k v J 

This will complete the proof of Theorem 13.21 for then, combining (1251) and (|2f)jl and letting 
n — ► oo in ([25)1 . we would have /z(a) = 0, contradicting the assumption that a is nontorsion. 

3.1.1. The Archimedean Case: Let v be an archimedean place of k. To simplify notation 
we view k as embedded in C and fix an isomorphism of k v with C. Thus, the way k is 
embedded depends on the choice of v. 

To prove ()26j) we will need two facts: David/Hirata-Kohno's theorem on linear forms in 
elliptic logarithms, and a strong form of equidistribution for torsion points. 

The following is a special case of QDHK02 , Theorem 1, p. 31): 

Proposition 3.3. (David/Hirata-Kohno) 

Let E/k be an elliptic curve defined over a number field k C C. Fix an isomorphism 
9 : C/A = E(C) for an appropriate lattice A C C. Let uj\,U2 be generators for A. Fix a 
non-torsion point a G E{k) and let a G C be such that 9 (a mod A) = a. 

Then there is a constant C = C(a) > such that for all rational numbers £i/N,£ 2 /N with 

e 1 ,e 2 ,Nez, 

|a- (1^ + 1^)1 > e- Cmax ^)) . 

By Ullmo's theorem ((D98J, the Galois conjugates of the £ n are equidistributed in E(C). 
As we will see, they are in fact strongly equidistributed, in a sense analogous to that in 
Proposition 12.41 
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If £ £ E(k) tOTS , write Gal(k/k) ■ £ for the orbit {cr(£) : a G Gal(A;/A;)}. For each set 
U C E(C), write 

N(£,U) = #((Gal(fc/fc) ■ n U) . 
Let S C C be a bounded, convex, centrally symmetric set with in its interior. For each 
a G C and < r G K, write S(a,r) = {a + : z G 5}. For example, if 5 = 5(0, 1) then 
S(a, r) = B(a, r). 

Let A C C be a lattice such that 5(C) = C/A. Let r = r (<S,A) > be the largest 
number such that «S(a, r) injects into C/A = 5(C) under the natural projection for all a G C 
and all < r < r . Write <S^(a, r) for the image of S(a,r) in 5(C). 

Proposition 3.4. (Strong Equidistribution) Let k G C be a number field, and let E/k be 

an elliptic curve. Then the Ga\(k/k)- conjugates of the torsion points in E(k) are strongly 
equidistributed in 5(C) in the following sense: 

Let \i be the additive Haar measure on 5(C) with total mass 1. Fix 7 with < 7 < 1/2, 
and fix a bounded, convex, centrally symmetric set S with in its interior. Then for each r 
such that S(a,r) injects into 5(C), and for all £ G E(k) tOTS , 

N(£,S E {a,r)) n(\uc\ n-i\ 

^ . fci = MSMa,r)) + O([A;(0 : fc] 7 ) 

where the implied constant depends only on S, E, and 7. 
The proof will be given in below. 



We can now complete the proof of ()26|) in the archimedean case. The argument is similar 
to the one in the proof of Theorem 12.11 By Ullmo's theorem (|JJ98 ), or by Proposition 13.41 
when S has the shape of a period parallelogram (so 5 can be tiled with sets <S^(a, r)), one 
knows that as n — ► 00 the discrete measures 

converge weakly to the Haar measure /1 on 5(C) having total mass 1. Proving is 
equivalent to showing that 

lim / \ v (a — z) d[i n (z) = 0. 

n ^°° JE(C) 

Choose a lattice A C C such that 5(C) = C/A, and let 5 be the area of a fundamental 
domain for A. After scaling A, if necessary, we can assume that 5 = 1. After this normal- 
ization, fi coincides with Lebesgue measure. Let 9 : C/A = 5(C) be an isomorphism as in 
the David/Hirata-Kohno theorem, and let a G C be a point with 9 (a mod A) = a. 

Fix e > small enough that B(a,e) injects into C/A, and identify B(a,e) with its image 
Be(cl,£) = 9(B(a,e)) C 5(C). (In particular, identify a with a). Without loss, we can 
assume that e < 1/ir, so ns 2 < e. We will show that for all large n, 



(27) I / X v (a-z)dfi n (z)\ < Qs . 

JE(C) 

Put 

— [k v : R] log(|2 — a\/e) if z G B(a,r), 

if z G E(C)\B(a,r) 
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and note that 



< / Q a>£ (z) dfi(z) = / -[k v :R]log(\z-a\/e)dfi(z) 

>E(C) JB{a,e) 



. TIE 2 



2 < £ 



= [k v : R] / -27rtlog(t/e) dt = [k v 
Jo 

By Proposition 13. II A there is a continuous function g a>e (z) on E(C) such that 

\ v (a - z) = Q a:£ (z) + g a>E {z) . 
Since J E ^ \ v (a — z) dfx(z) = (also by Proposition 13. II A), we get 

I / g a , E {z) dn{z)\ = | / -Q a>e (z)dn(z)\ < e. 

JE{C) JB{a,e) 

By weak convergence, it follows that for all sufficiently large n, 



(28) | / g a , £ (z) dn n (z)\ < 2e. 

Je(c) 

To complete the proof of ([27)1 . it will suffice to show that for all sufficiently large n, 

(29) | I log{\z-a\/e)d fJin {z)\ < 2e . 

JB(a,r) 

For this, put D = D n = \[k(£ n ) : A;] 1 / 8 ], and subdivide B(a,e) into a disc A (n) = 
B(a, e/D) and annuli A e (n) = B(a, (£ + l)e/D)\B(a, ie/D) for I = 1, . . . , D - 1. 

For the central disc, we have /j,(A (n)) = ire 2 /D 2 < ne 2 /[k(^ n ) : A;] 1 / 4 . Applying Proposi- 
tion HO] when S is a disc, taking 7 = 3/8, gives 

N(C n ,A 1 (n))/[k{Q:k] < 2/x(A (n)) 

for all sufficiently large n. If £ n has order N n , the David/Hirata-Kohno theorem tells us that 
for each conjugate a(£ n ) G A (n) (where as before we are identifying B(a,e) with its image 
9(B(a,£)) c E(C)) 

|log(|a(e„)-a|)| < Clog(iV„) . 

_____ 1/2 

Using and (jjOj) below, one sees that : fc] > N n for all sufficently large n. Thus 

0<|log(Ke„)-a|)| < 2Clog([A;(e„) : *>]) and 



log([fc(Cn) = A;]) 
[^ n ):A;]V4 



(30) < I / log(|a?-a|)d^(z)| < 4vre 2 C 

for all sufficiently large n. 

For each annulus Ai(n), £ = 1, . . . , D — 1, one has 

/i(4/(n)) = n(2£+ l)e 2 / D 2 = n(2£ + l)e 2 /[k(0 : A;] 1/4 . 

Since ^(n) is the difference of two sets to which Proposition 13.41 applies, we find as above 
that for sufficiently large n, 

N(£ n ,A £ (n))/[k(£ n ):k] < 2fi(A t (n)) . 
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Note that on Ai(n), | log(|2 — af|/e)| < —log(£/D). Summing over these annuli, and 
bounding the resulting Riemann sum by an integral, we find that 

D-l 



\ log(\z-a\/e)dii n (z)\ < ^-log((^)/e).2/i(^(n)) 

J B(a,e)\A (n) £=1 U 

< 2- / -2irt\og(t/e)dt 

JB(a,e) 
.2 



(31) = tte 2 < e . 

Combining ()30|) and ()31|) gives (|29jh which completes the proof of f)26j) in the archimedean 
case (assuming Proposition 13. 4|) . 



3.1.2. The Nonarchimedean Case: In the nonarchimedean case, the proof of (j2oJ) depends 



on a well-known result of Cassels on the denominators of torsion points (see |Si86j , Theorem 
3.4, p. 177). Write O v for the ring of integers of k v . 

Proposition 3.5. (Cassels) 

Let k v be a local field of characteristic and residue characteristic p > 0, and let E/k v be 
an elliptic curve defined by a Weierstrass equation 

y 2 + a\xy + a 3 y = x 3 + a 2 x 2 + a^x + a 6 

whose coefficients belong to O v (N. B. the Weierstrass equation need not be minimal.) Let 
P G E(k v )tors be a point of exact order m > 2. 

(A) If m is not a power of p, then x(P),y(P) G O v . 

(B) Ifm = p n , then x(P) = a/D 2 , y{P) = b/D 3 where a,b,D G O v and 

ord v (p) 



ord„(£>) < 



Proof. Silverman ( |Si86j . Theorem 3.4) states the theorem for torsion points belonging to 
E(k v ), with a,b,D G k v in part B) and D satisfying 

(32) °<MD) = L^rj • 

Since the Weierstrass equation for E need not be minimal, we can replace k v by an arbitrary 
finite extension L w /k v , and if e w / v is the ramification index of L w /k v , then for P G E(L w ) tOTS 
and a,b,D G L w , (J3~2~)) becomes 

(33) ord„P) = — • [ e ^° rd -? J . 

e w/v 1 p n - p n ~ l J 

This yields the result for all P G E(k v ) tors . □ 

Corollary 3.6. Let E/k v be an elliptic curve defined over a nonarchimedean local field. 
Then for each nontorsion point a G E(k v ): 

(A) There is a number M such that for all £ G E(k v )toi S , 

X v (a-0 < M. 

(B) If E has good reduction, then for each e > 0, there are only finitely many ^ G E(k v ) tor s 
with X v (a — £) > e. 
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If E is a Tate curve, then for each e > 0, there are only finitely many £ G E(k v ) tOTS with 
\ v (a-£)>e+±(-log(\A(E)\ v )). 

Proof. After a finite base extension, we can assume that E either has good reduction or is a 
Tate curve. Since (B) implies (A), it suffices to prove (B). Fix e > 0. 

First suppose E has good reduction. Then X v (x — y) = — log(||x, y\\ v ), where y\\ v 
is the spherical distance on the minimal Weierstrass model for E/k v . If £i,£ 2 £ E(k v ) tors 
satisfy X v (a — £j) > e, then ||£ 1; a\\ v , \\£ 2 , a \\v < (Nv)~ £ where Nv is the order of the residue 
field of O v . By the the ultrametric inequality for the spherical distance ( |Ru89j . §1.1), 
||£i,6lk < (Nv)~ £ . By translation invariance ||£i - £ 2 , 0||„ < (Nv)' e . Put £ := £1 - £ 2 . By 
the definition of the spherical distance, if x, y are the coordinate functions in the minimal 
Weierstrass model, 

-log(||£,0y =min(oTd v (x(£)),OTd v (y(0))-\og(Nv) . 
By Cassels' theorem, there only finitely many torsion points for which 

min(ord v (x(£)),ord v (y(£))) > e/log(Nv) . 

Next suppose E is a Tate curve. Fix a Tate isomorphism E{k v ) = k* /q z where \q\ v = 
\A(E)\ V < 1, and let y 2 + xy = x 3 + a 4 (g)x + a 6 (q) be the corresponding Weierstrass 
equation. Let a,Ui,u 2 G k* correspond to a,£i,£ 2 respectively; we can assume that \q\ v < 
\a\ v , \ui\ v , \1i2\v < 1- By the formula for X v (x — y) in Proposition I3.11 B. if X v (a — £«) > 
e + ^(— logflA^)^)), then | a\ v = \ui\ v = \u 2 \ v and 

— log(|l — a" Ui\ v ) = ord„(l — a -1 ^) ■ \og(Nv) > e. 

Put £ = £1 — £ 2 and u = u 2 U\. Then £ corresponds to u under the Tate isomorphism, and 
ord„(l — u) > e/\og(Nv). By the formulas for x(£), ?/(£) in ( |Si99j . p. 425), ord„(a;(£)) = 
2ord 1) (l — u) and ord„ (?/(£)) = 3ordt,(l — u). Again by Cassels' theorem, only finitely many 
torsion points £ can satisfy min(ord t) (x(£)), ord„ (?/(£))) > e/\og(Nv). □ 



We can now prove (|26|) when E has good reduction at v. 

Fix e > 0. Let M be the upper bound in Corollary I3.61 A. and let N be the number of 
points £ G E(k v ) tors with X v (a — £) > e given by Corollary I3.61 B. For all sufficiently large n, 
MN/[k(£ n ) :k]<e, giving 

(T a fXt j FX/ ^ ^ rCu 

([jb^) : jfc] - JV) JV 

[*(&):*] [*(&):*] 



Thus 



o".k j \z c — 



To prove (|26|) when E is a Tate curve at v, we will need the following equidistribution 
theorem of Chambert-Loir ( |CLpp| , Corollaire 5.5). 
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Fix a Tate isomorphism E(k v ) = k v /q z , put L = Z • ord v (q) C M., and define a "reduction 
map" r : £7(&) — > K/L by setting r(P) = ord^a) (mod L) if P G E(k v ) corresponds to 

"T x 

a e k v . 

For each global point P E E(k), define a measure fip tV on R/L by 

= [k(p\ : jfc] Sr ^ p ^ 

and let fi v be the Haar measure on M./L with total mass 1. 

Proposition 3.7. (Chambert-Loir) 

For each sequence of points {P n } in E(k) with h(P n ) — > 0, the sequence of measures 
{fip„, v } converges weakly to fi v . 

We can now prove ()2(ij) when E is a Tate curve. Let {£ n } be a sequence of torsion points 
which are S- integral with respect to a. 

Fix e > 0. Let M be the upper bound in Corollary IH.fil A. Put a = r(a) and let 5 > be 
such that n((a — 5,a + 5)) < s/M, where by abuse of notation we identify a sufficiently short 
interval in R with its image in M/L. By Chambert-Loir's theorem, fi^ nAJ ((a— 5, a+5)) < 2e/M 
for all sufficiently large n. 

By the formulas in Proposition 13. 11 B. J R/L X v (z) dji v (z) = and 



(T.K/ K'~ — *Kv 

< I / \ v (z - a) dfi£ ntV (z)\ + M ■ fi£ n , v ((a — 8, a + 8)) . 
Jr/l 

For sufficiently large n the right side is at most 3e. Hence 

This completes the proof of Theorem 13.21 □ 



Several results in the literature use methods similar to ours, though none of them yields 
Theorem 13.21 

J. Cheon and S. Halm CH99 proved an elliptic curve analogue of Schinzel's theorem 
|Sc74j . Likewise, Everest and B. Ni Flathuin }E F96j evaluate 'elliptic Mahler measures' in 
terms of limits involving division polynomials, obtaining results similar to (|15|). They use 
David/Hirata-Kohno's theorem on elliptic logarithms in place of Baker's theorem, much as 
we do. 

More recently, L. Szpiro and T. Tucker [STpp] proved that local canonical heights for 
a dynamical system can be evaluated by taking limits over 'division polynomials' for the 
dynamical system. (These polynomials have periodic points as their roots). Their work 
uses Roth's theorem rather than Baker's or David/Hirata-Kohno's theorem. It would be 
interesting to see if this could be brought to bear on Conjecture 11.31 
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3.2. Strong equidistribution for torsion points on elliptic curves. We will now prove 
Proposition 13.41 the strong equidistribution theorem for torsion points on elliptic curves 
which was used in the proof of Theorem 13.21 

Proof, of Proposition 13.41 

The proof breaks into two cases, depending on whether E has complex multiplication or 
not. 

First suppose E does not have complex multiplication. 

As usual, the action of Gs\{k/k) on E(k) tOTS gives a homomorphism 

r] : Gal(k/k) -> limGL 2 (Z/iVZ) S JjGL 2 (Z p ) . 

v 

By Serre's theorem ( [Se72j . Theoreme 3), the image of Gai(k/k) in rT^GL^Zp) is open. 
Thus there is a number Q such that lm(rj) contains the subgroup 

Y[(l + QM 2 {Z P )) x J]GL 2 (Z P ) . 

p\Q pK3 

Let Gq C Gai(k/k) be the pre-image of this subgroup. 

Let £ G E(k) tOTS have order N, and put Qn = gcd(Q,N). For suitable right coset rep- 
resentatives 0i,...,<7t of Gq in Gal(A;/A;), the Galois orbit Gal(k/k) ■ £ decomposes as a 
disjoint union of Gg-orbits: 

T 

Gal{k/k)-£ = \jG Q -<Ti(0 ■ 

i=i 

Since Gq is normal in Gal(k/k), the orbits Gq ■ Oi(£) = ct^Gq ■ £) all have the same size. 
Thus [&(£) : fc] = T ■ #(Gq • £). By considering the action of Gq on the p-parts of £, one sees 
that 

#(Gq ■ = ]^[ /KW-^Wn)) . p2ord p (A0^ _ 

p\Qn p\N P 

pIQn 

(34) = ^-na-i)- 

°™ p\N P 

p]/Q 

Indeed, let £ p be the p-component of £ in £[iV] = H plN (Z/p md p (N ^Z) 2 . Identify £ p with an 

element of (Z/p ord ? (7V) Z) 2 , and note that it is a generator for that group. If p\Qn, the image 
of G Q in GL 2 (Z/j9 ord p (7V) Z) is J + p ord ?( Q ^M 2 (Z/p ord ^Z), and 

Gq ■ £ P = £ P + p ort W^ • (Z/p ord p (JV) Z) 2 . 

On the other hand, if p/Qn, the image of Gq in GL 2 (Z/p ordp ( 7V )Z) is the full group, so 

G Q -i P = (Z/p OTd v {N) Z) 2 \p ■ (Z/p ord » (7V) Z) 2 . 

Write Atv = ^A, fix o-j, and let x G A^v correspond to Since -E[AT] = Ajv/A, 

the considerations above show there is a one-to-one correspondence between elements of 
Gq ■ (7j(£), and cosets y + A for y G A^r such that y — x G Qn-^n and y + A has exact order 
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N in Ajv/A. Equivalently, y — x E Qn^n and ?/ ^ pAjy for each prime p dividing N but not 
Q. 

Let pi, . . . ,pr be the primes dividing N but not Q; if there are no such primes, take 
Pi 4 ■ ~Pr = 1- Since Qjv and Pi, • • • ,Pr are pairwise coprime, there is an x G A^v such that 
xq = x (mod QtvAat) and xo = (mod p\ ■ ■ -prAn). Then y — x E QtvA^v if and only if 
y E xo + QnAn, and y G p«Ajv if and only if y G xo + PiA^. Note that if D|pi ■ ■ - pr then 
QnAn H -DAat = QnDKn. Take a G C and < r < r . Using the fact that S(a,r) injects 
into C/A and applying inclusion-exclusion, we obtain 

(35) #(G -a i (0n5 E (a 1 r)) 

= ^ (-l) A ( fl ).#((x + Q w M N )n%r)) 

D\p!-p R 

where \(D) is number of primes dividing D. 

Let T be a fundamental domain for A; we can assume J 7 is bounded and contains 0. Let 
C be such that T C 5(0, C). Note that since 5 is convex, if z\ G <S(ai, r\) and z 2 G 5(a 2 , r 2 ), 
then zi + z 2 G <S(ai + a 2 , n + r 2 ). Put F = area(jF), S = area(5); then area(tjF) = t 2 F and 
area(«S(a, r)) = r 2 S. 

Each lattice QnD An is homothetic to An, and hence has fundamental domain (QnD/N) ■ 
T C 5(0, C • Q N D/N). Write t = Q N D/N, so Q N DA N = tA and tJF C 5(0, tC). As y 
runs over xq + tA, the sets y + tJ 7 are pairwise disjoint and cover C. If y G S(a,r), then 
y + tJ 7 C S(a, r + tC). Hence 



(36) #((x + tA)H5(a,r)) < 



area(5(a, r + £C)) 

area(£F) 
r 2 S 1 2C5r 1 C 2 S 
~F~ ' t 2 ^ F~ ' t + ~F~ 



Similarly, if r > tC, take z G 5(a, r — tC), and let y G xq + tA be such that z G y + tJF. 
Then z — y E tJ 7 , so z — y G 5(0, tC), and since 5 is centrally symmetric y — z E 5(0, tC). 
Thus y = z + (y - z) E S(a,r). It follows that 5(a, r - tC) C LU(c Co +tA)ns(a,r) (2/ + so 

, , „.. area(5(a, r — tC)) 

(37) #((x + tA)n5(a,r)) > - 



> 



area(tJF) 
r 2 S 1 2CSr 1 C 2 5 



If r <tC then the right side of (J37)) is negative, so the inequality between the first and last 
quantities holds trivially. 

Replacing t by its value QnD /N and combining (|HHJ), ((3ZJ), we obtain 

i ,,// ^ ^. n nn area(5(a, r)) A^ 2 , 

#((x + Qi VJ DA 7 v)n5(a,r))- 



area(JF) Q%D 2 



^ ' " F QnD F 
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Inserting ()38|) in the inclusion-exclusion relation (J33j) and summing over all cxj(£), i 
1, . . . , T, we find 

»r/>. ^ / w area(<S(a, r)) TiV 2 -i-t l s 



p|iV,p)Q 



AN 



where as before, denotes a quantity with — x < 9(x) < x. By 

t /v 2 

(39) [*(£):*] = T-#(G Q -0 = — H 



Qn p\N, P iQ 



Since r < r , it follows that 



N(C,S E (a,r)) area(S(a,r)) { ^,2CSr Qn 



[k(t):k] " area(F) ^ F ^n^W 1 ^) 



,C 2 S 2 R Q 2 



i 



Here area(iS(a, r))/area(.F) = //(<Ss(a, ?*)). Note that T < #(GL 2 (Z/QZ)) is bounded, 
Qat < Q is bounded, and ^71^(1 — -) > N 1 " 5 for each e > and each sufficiently large 
N. Using and the fact that 1 > n p |Ar P }Q(l — tO — VC(2) one sees that the first error 
term is 7 ([fc(f) : A;]" 7 ) for each 7 < 1/2. Similarly, 2 R < d(N) < N e for each e > and 
each sufficiently large N. Thus the second error term is negligible in comparison with the 
first. 

This completes the proof of Proposition 13 .41 when F does not have complex multiplication. 

Now suppose E has complex multiplication. Let K be the CM field, and let O C Ok be 
the order corresponding to E. After enlarging k if necessary, we can assume that K C k. 
Let A C C be a lattice such that E = C/A. Without loss of generality, we can assume that 
A C K. Fix an analytic isomorphism d : C/A = F(C). 

By the theory of complex multiplication (see |Sh71j . |L73j . or [Si99 , Chapter II), E{k) tors 
is rational over k ab , the maximal abelian extension of k. Let k^ be the idele ring of k, and 
for s £ k^ let [s, k] be the Artin map acting on k ab . Given a G G&l(k/k), take sG^ with 
a\ k ab = [s,k], and put w = N^/x{s) G K^. There is an action of K£ on lattices, defined 
semi-locally, which associates to w and A a new lattice w~ 1 A. This action extends to a map 
w^ 1 : K/A — > K/w~ 1 A. There is also a homomorphism ip : k^ —>■ K x , the 'grossencharacter' 
of F, which has the property that , 0(s)A^ fc /^(s) _1 A = A. Put k = if)(s) G F x . 

With this notation, there is a commutative diagram: 



w 



K/A - 


-> C/A 


* F(/c)tors 


I 




i 


K/w^A <- 


-> C/w^A 


> F(/c)tors 


« 1 




I ^ 


F/A - 


-> C/A 


* F(fc) tors 
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in which the vertical arrows on the left are multiplication by w~ x and k respectively, and those 
on the right are the Galois action (see [Sh71j. Proposition 7.40, p. 211, or |L73j . Theorem 8, 
p. 137). Note that the same analytic isomorphism •§ appears in the top and bottom rows. 
Thus, if £ G E(k) tOTS corresponds to x G K/A, and a\ k ab = [s, k], then 

This gives an explicit description of the Galois action on torsion points in terms of adelic 
"multiplication" . 

The action of K£ in the diagram is as follows. Let L C K be a lattice. For each rational 
prime p of Q, write L p = L®% Z p and K p = K ®q Q p ; if w G K£ , let w p be its p-component. 
Then w~ x L p is a Z p -lattice in K p . There is a unique lattice M C K such that M p = w p x L p 
for each p ( |L73j . Theorem 8, p. 97), and w~ x L is defined to be M. Likewise, if x G K/L, 
lift it to an element of K C K& and write x p G K p for its p-component; there is a y G K 
such that w~ l x p (mod w~ x L p ) = y (mod M p ) for each p, and w _1 (a; (mod L)) is defined to 
be y (mod M). 

The order (9 has the form O = Z+cOk for some integer c > 1, and c is called the conductor 
of O. The lattice A is a proper (9-lattice, meaning that O = {x G K : a; A C A}. For any 
order O, there are only finitely many homothety classes of proper CMattices ( |L73j . Theorem 
7, p.95). Write O p = O ® z Z p and KiP = O k ® z Z p . If p fc, then O p = C^ p £ n„ b ^p, 
where p runs over the primes of if lying over p, and Ox,p is the completion of Ok at p. 

The kernel U of the grossencharacter ip : — > K x is open in so its image = 
N k / K {U) C ifjj is open. Thus there is an integer Q > 1 such that for each p|Q, the subgroup 
1 + QO K>p C Kp is contained in Wp and for each p jf Q, Kp C W p . If w G W, then 
w^A = A, so io p G O p . Hence c|Q. 

Noting that O p = KiP iip/Q, let Wq C lf£ be the subgroup 

c x x \{{i + qo p )x \\o; C W , 

p\Q pJQ 

and let Uq be its preimage in k^ under the norm map. Put 

G Q = {a G Gal(k/k) : cr|£; a6 = [s, k] for some s G . 

Then Gq is open and normal in Gal(k/k). 

Fix £ G E(k) tors . Suppose £ has order N; put Qtv = gcd(Q,A^). For suitable right coset 
representatives oi, . . . , a? of Gq in Gal(/c/fc), the orbit Gal(/c//c) ■ £ decomposes as a disjoint 
union of GQ-orbits: 

T 

Gal(*/A;)-£ = \JGq-°M) ■ 
i=i 

As before, the orbits Gq-g^E) = ct,;(Gq-£) all have the same size, and [&(£) : fc] = T-#(Gq-£). 

Let £ correspond to i + A 6 if/A. Write A(x) for the O-lattice Ox + A; since £ has order 
iV, [A (a:) : A] > N. More generally, for any integer m, put A(mx) = O ■ mx + A = mOx + A. 
Note that 

A(mx)/A = Y[A(mx)p/A p = Y\(mO p x + A p )/A p . 

p\N p\N 
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If p\Q, then Gq acts on £ p through the subg roup 1 +p OTd p^O p C 0*. Noting that 
ord p (Q N ) = min(ord p (Q),ordp(iV)) and that p OI MQ) x e \ p if ord p (Q) > ord p (iV), we have 

G Q • £ P = (x + p ord ^ Q) O p x + A p )/A p = (x + A(p old ^x) p )/A p . 

Thus • Q = [A(p°*MQN) x ) p : A,]. 

If p | Q, then O p = K , P and Gq acts on £ p through £>* = n^^p- For each 
p\p, and each O-lattice L, we have L p = (OkL) p where OkL is an (9^-fractional ideal. 
Thus ordp(L) := ord p (C^L) is well defined. Write ord„(£) = ord p (A) - ord p (A(x)). Then 
A(x) p /A p - n p | P ^/P ordp(5) and 

• Q = [A(x) p : Ap] • J] (1 - ^) 

p|p 

ordp (£)>0 

where Np = #(Ok/p) is the norm of p. 

Combining these formulas, and using that Y[ P \N[A(p° Tdp ^ QNS>x )p '■ A p ] = [A(Q N x) : A], we 
obtain 

(40) #(G Q • = [A(Q N x) : A] • J] (l - . 

ordp (£)>0 

If L is any (9-lattice, and F(L) is the area of a fundamental domain for C/L, then by 
Minkowski's theorem there is a point / f G i with |£| < (4/7r) 1 / 2 F(L) 1 / 2 . Here, L is 
a proper O'-lattice for some order O' with conductor c'|c. There are only finitely many 
such orders O' , and for each O' there are only finitely many homothety classes of proper 
O'-lattices, so there are only a finitely many homothety classes of 0-lattices. Hence there 
is a constant Ci, independent of L, such that L has a fundamental domain ^"(L) contained 
in the ball B(0, C\ ■ F(L) l l 2 ). In turn, there is a constant C, independent of L, such that 
F{L) C 5(0, C ■ F(L) 1 / 2 ). This fact is the crux of the proof. 

Again, if L is an (9-lattice, then for each ideal zu of Ok coprime to c, there is a unique lattice 
zuL defined by the property that (zuL) q = {wOkF) q for all primes q\Nzu, and (zuL) g = L q 
for all primes q jf ' Nw. This lattice has index [L : wL] = Nw. 

Now consider a set S(a,r), where a e C and r < r . For each <7j(£), we will compute 
• o-j(O) ^ <SE(o-,r))- Fix (Tj, and replace £ by <7j(£) in the discussion above. Let 
x E K/A correspond to <7j(£), and let p 1: . . . ,p R be the primes of O k dividing N but not 
Q, for which ord p (A(x)) ^ ord p (A). (Note that the pj are independent of <7j, since K C k 
and for p jfQ, Oi acts on £ through 0* .) Then there is a one-to-one correspondence between 
elements of Gq • <7j(£), and cosets y + A for y e K such that + A(<5at:t) and y ^ pjA(x) 
for j — 1, ... , R. Since A(<3jv#) C A(x), such y necessarily belong to A(x). 

The lattices A(Q N x) and pi • • -PrA(x) have coprime indices in A(x), so there is an x G 
A(x) such that a; = x (mod A(Qjv£)) and x = (mod pi • • -p^A(a;)). Further, for any 
Ox-ideal w dividing pi • • -p R , 



A(Q N x) p| ( p| pjA(x)) = zuA(Q N x) 
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Clearly y G x + A(Q N x) if and only if y G x + A(Q N x), and y G pjA(x) if and only if 
y G xq + pjA(x). Since S(a,r) injects into C/A, by inclusion-exclusion 

(41) #((G Q -a,(0)n<^(^)) 

= Yl (-l) XK{ ^-#((xo + ™HQNx))f)S(a,r)) 

m\pi-p R 

where \k(^) is the number of prime ideals of Ok dividing w. 

Take L = wA(Qnx), and note that its fundamental domain ^(L) has area F(L) = 
F ■ Nva /[A(Qnx) : A], where F is the area of a fundamental domain JF for A. By the same 
argument leading to (j4TiJ) we find 

..^ area(<S(a, r)) [A(Qatx) : Al , 

1 areafy-) Atu 1 

2C\Sr r [A(g^) : A] 1/2 C 2 S 

Here the index [A{Q N x) : A] is independent of cxj by (jlUj) . since #(Gg ■ o"j(£)) and the p., 
are independent of <X;. Inserting (|42j) in the inclusion-exclusion formula (J41|) and summing 
over all <7j(£), 

»r/>. ^ / „ area(5(a, r)) mr ,,^ , . -, -tt / 1 \ 

Ar(f ^ (o - r » = ^r - r ' A '^ :A 'n(i-^) 

+ »(^-T[A(O w ,):A]V»n(l + -U )) 
By (HDD, [*(£) : k] = T[A(Q N x) : A] Uf^l-l/Npj). Since r < r and nf=i(l + Wp 1/2 ) < 



N(£,S E (a,r)) area(<S(a, r)) 
[£;(£) : fc] area(jF) 

1 ^ '(njLi(i-v^Pi)) i/2 '[^)^] l/2j 

(43) 

As before area(5(a, r))/area(^ r ) = h(Se(ci, r)). Here T < [Gal(/c/A;) : Gq] is fixed. For 
each s > and each sufficiently large JV, 2 R < 2 Ak(n *> < 2 2X(N ^ < d(N) 2 < N £ . Likewise, 
Ilf=i( 1 - 1 / N P) > YlpiA 1 ~ 1 /Pf > C / {\og\og{N)f for some constant C > 0, where the 
last inequality follows from f |HW71j . Theorem 328, p. 267). Finally, since £ has order N and 
Qn < Q is bounded, [A^^rx) : A] > N/Q, and so 



(44) : k) > T-N/Q- C/(loglog(iV)) 2 > TC/Q ■ N 



1-e 
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for all large N. Combining these shows that for each < 7 < 1/2, the first error term is 
7 ([/&(£) : fc] -7 ). The same estimates show the second error term is negligible in comparison 
to the first. 

This completes the proof of Proposition 13 . 41 when E has complex multiplication. 

Before closing, we note for purposes of reference that the arguments above provide lower 
bounds for the degree : k] in terms of the order N of £. When E does not have complex 
multiplication, then since T is fixed, Qn < Q, and Yl p (l — l/p 2 ) converges to a nonzero 
limit, (|39|) shows there is a constant C\ depending only on E such that 

(45) [k(0 : k] > C^ 2 . 

When E has complex multiplication, then since T and Q are fixed, f)44j) shows that there is 
a constant C 2 depending only on E such that 

(46) [*(£) : k] > C 2 N/(\og\og(N)) 2 . 

□ 
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